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(57) 



ABSTRACT 



In methods and apparatus for echo cancellation, a noise 
estimate is selectively added to a downlink communications 
signals to improve the convergence speed and stability of an 
echo-canceling adaptive filter. The added signal content 
enables the echo-canceling adaptive filter to more quickly 
track echo path changes during user communications and 
prevents divergence of the adaptive filter during periods in 
which the downlink signal does not contain information 
(e.g., speech) sufficient to develop a quality echo path 
estimate. To prevent performance degradation from a user 
perspective, the added noise is made to resemble existing 
system noise (e.g., either near-end or far-end background 
noise). In exemplary embodiments, filter convergence speed 
and stability are further improved by whitening the spectrum 
and increasing the power level of the added noise whenever 
a near-end user is active and masking the added noise. 

25 Claims, 4 Drawing Sheets 
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METHODS AND APPARATUS FOR far-end user is not speaking (i.e., when the energy content of 

IMPROVING ECHO SUPPRESSION IN the loudspeaker signal is insufficient to provide a basis for 

BI-DIRECTIONAL COMMUNICATIONS developing a quality echo estimate). Consequently, signifi- 

SYSTEMS cant residual echo can also be temporarily passed back to the 

5 far-end user each time the far-end user begins to speak after 
having been silent for a period of time. 
FIELD OF THE INVENTION Jo rcduce ccho immediatcly following first 
The present invention relates to communications systems, installation of a device in a new and acoustically unknown 
and more particularly, to echo suppression in bi-directional environment, conventional systems often employ an initial- 
communications systems. 10 ization sequence to train the echo canceler before the device 

is used for actual communications. Specifically, an artificial 

BACKGROUND OF THE INVENTION audio signal (typically white noise) is played through the 

Adaptive filtering arrangements are prevalent in commu- loudspeaker, and the echo canceler is given time to converge 

nications systems of today. Such arrangements are typically t0 me new echo P alh P rior to a first cal1 bein S made or 

used to reduce or remove unwanted signal components 15 received. However, such an approach does not address the 

and/or to control or enhance signal components of interest. above described problems associated with slow filter recov- 

A common example of such a filtering arrangement er ? ^ e Rowing changes in the echo path or following 

relates to hands-free telephony, wherein the built-in ear- ? cnods °J ***** us f sdcncc ' V 1 ™' mc f 15 \ nced for 

phone and microphone of a conventional telephone handset on methods and a PP aratus for P r0Vldm S echo cance * 

are replaced with an external loudspeaker and an external 2 ° latl0D m communications systems, 

microphone, respectively, so that the telephone user can SUMMARY OF THE INVENTION 
converse without having to physically hold the telephone 

unit in hand. Since sound emanating from the external The present invention fulfills the above -described and 
loudspeaker can be picked up by the external microphone, ^ other needs by providing echo canceling techniques wherein 
adaptive filtering is commonly performed in order to prevent a noise signal is selectively added to a downlink (e.g., 
the loudspeaker output from echoing back and annoying the loudspeaker) signal to thereby improve adaptive filter con- 
far-end user at the other end of the conversation. This type vergence speed. Advantageously, the added signal content 
of adaptive filtering, or echo canceling, has become a basic enables an echo -canceling adaptive filter to more quickly 
feature of the full-duplex, hands-free communications 3Q track echo path changes during user communications, 
devices of today. Moreover, the added noise prevents divergence of the adap- 
TVpicaUy, echo cancelation is achieved by passing the tive filter during periods in which the downlink signal does 
loudspeaker signal through an adaptive Finite Impulse not contain information (e.g., far-end speech) sufficient for 
Response (FIR) filter which approximates, or models, the developing a good estimate of the echo path, 
acoustic echo path between the hands-free loudspeaker and 35 To prevent performance degradation from a user 
the hands-free microphone (e.g., a passenger cabin in an perspective, the added noise can be made to resemble 
automobile hands-free telephony application). The FIR filter existing system noise (e.g., either near-end or far-end back- 
thus provides an echo estimate which can be removed from ground noise). For example, the power spectrum and level of 
the microphone output signal prior to transmission to the existing near-end noise can be estimated in real-time, and 
far-end user. The filtering characteristic (i.e., the set of FIR 40 the added noise can be generated having a similar or 
coefficients) of the adaptive FIR filter is dynamically and identical spectrum and a somewhat lower power level. Thus, 
continuously adjusted, based on both the loudspeaker input a system constructed according to the invention can provide 
and the echo-canceled microphone output, to provide a close the benefits of enhanced filter convergence speed without 
approximation to the echo path and to track changes in the creating user-perceptible differences in overall system 
echo path (e.g., when a near-end user of an automobile 45 operation. 

hands-free telephone shifts position within the passenger Advantageously, filter adaptation speed can be further 

cabin). enhanced according to the invention by modifying charac- 

Adjustment of the filtering characteristic is commonly teristics of the added noise at appropriate times. For 
achieved using a form of the well known Least Mean Square example, the spectrum of the added noise can be spread, and 
(LMS) adaptation algorithm developed by Widrow and Huff 50 the power level of the added noise can be increased, when- 
in 1960. The LMS algorithm is a least square stochastic ever the near-end user of a handsfree telephone is speaking 
gradient step method which, as it is both efficient and robust, (and therefore effectively masking the added noise). Whit- 
is often used in many real-time applications. The LMS ening and strengthening the added noise signal enables the 
algorithm and its well known variations (e.g., the Normal- adaptive echo -canceler to better identify the true echo path, 
ized LMS, or NLMS algorithm) do have certain drawbacks, 55 and doing so only when the near-end user is speaking results 
however. For example, the LMS and other known algo- in no performance degradation (i.e., no user-perceptible 
rithms can sometimes be slow to converge (i.e., approach the changes in overall system operation), 
target filtering characteristic, such as the acoustic echo path An exemplary communications device according to the 
in a hands-free telephony application), particularly when the invention includes an adaptive echo canceler receiving a 
algorithm is adapted, or trained, based on a non- white, or eo near-end audio signal and providing an echo-canceled near- 
colored, input signal. end signal for transmission to a far-end user via a commu- 

As a result, echo cancelers utilizing the LMS or other nications channel, adaptive filtering coefficients of the adap- 

adaptive algorithms can temporarily allow significant tive echo canceler being dynamically adjusted in 

residual echo to pass back to a far-end user whenever the dependence upon the echo-canceled near-end signal and 

true echo path is changing or unknown (e.g., upon first 65 upon a reference signal. The exemplary communications 

installation of a handsfree device). Moreover, known adap- device further includes a noise estimation processor receiv- 

tive algorithms tend to diverge during periods in which the ing a far-end audio signal via the communications channel 
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and providing the reference signal to the adaptive echo BRIEF DESCRIPTION OF THE DRAWINGS 

canceler, the noise estimation processor producing the ref- „ T _ „ . . , - 

i .■ i j/ • . «. r j j* FIG. 1 is a block diaeram of an exemplary hands-free 

erence signal by selectively adding noise to the far-end audio , , s • , . 

, r , .. J • telephony system incorporating a conventional echo cancel- 
signal. For example, the noise estimation processor can F 3 J v & 

include a voice activity detector for determining whether the 5 m ^ arrangement. 

far-end audio signal includes speech of the far-end user, and FIG 2 ^ a block diagram of an exemplary hands-free 

can thus add noise to the fai-end audio signal only during telephony system incorporating an echo canceling arrange- 

periods in which the far-end audio signal does not include ment according to the invention. 

speech of the far-end user. FIG. 3 is a flow diagram depicting steps in an exemplary 

To prevent degradation of system performance from the 10 method of echo cancellation according to the invention, 

user perspective, the noise added to the far-end audio signal FIG. 4 is a is a block diagram of an exemplary telephony 

can be an estimate of noise present in one of the near-end system incorporating an echo canceling arrangement 

and far-end environments of the device. Additionally, a level according to the invention, 
of the noise added to the far-end audio signal can be made 

slightly less than an estimated level of the noise present in is DETAILED DESCRIPTION OF THE 

the near-end environment. INVENTION 

To further improve adaptive echo canceler performance, FIG. 1 depicts bi-directional communications device 100 

the noise estimation processor can include a voice activity including a conventional echo -canceling arrangement. As 

detector for determining whether the near-end audio signal shown, the system 100 includes a microphone 110 , a sum- 

includes speech of the near-end user. Thus, the noise esti- 20 ming d evice 120, an adaptive filter 130, a filter computation 

mation processor can modify the noise added to the far-end processor i4Ufa speech encoder 150, a speech decoder 160, 

audio signal when the near-end audio signal includes speech and a loudspeaker 170. Those of ordinary skill in the art will 

of the near-end user. For example, the noise added to the appreciate that the below described functionality of the 

far-end audio signal can be whitened when the near-end components of FIG. 1 can be implemented using a variety of 

audio signal includes speech of the near-end user. 25 known hardware configurations, including a general purpose 

Additionally, a power level of the noise added to the far-end digital computer, standard digital signal processing 

audio signal can be increased when the near-end audio signal components, and/or one or more application-specific inte- 

includes speech of the near-end user. grated circuits (ASICs). Those of ordinary skill will also 

An exemplary method of echo suppression according to appreciate that, in practice, the exemplary system 100 

the invention includes the steps of filtering a near-end audio includes components (e.g., an analog -to- digital conv erter at 

signal to provide an echo-canceled near-end signal for Jbg^gu tput of the microphone 11 0 and a digital-to-analog 

transmission to a far-end user via a communications conv erter"" at the inp ut in the loudspeaker 170) which are 

channel, dynamically adjusting filtering coefficients used in omitteTfrom FIG. lTas they are not critical to an under- 

the filtering step in dependence upon the echo -canceled 35 standing of the present invention. 

near-end signal and upon a reference signal, and selectively In FIG. 1, an audio output of the microphone 110 is 

adding noise to a far-end audio signal to provide the refer- coupled to an additive input of the summing device 120, and 

ence signal used in the adjusting step. The step of selectively an output of the summing device 120 is coupled to an input 

adding noise can, for example, include the steps of deter- of the speech encoder 150 and to an input of the filter 

mining whether the far-end audio signal includes speech of 4Q computation processor 140. Additionally, an output of the 

the far-end user, and adding noise to the far-end audio signal speech decoder 160 is coupled to a second input of the filter 

only during periods in which the far-end audio signal does computation processor 140 and to an input of each of the m 

not include far-end user speech. adaptive filter 130 and the loudspeaker 170. An output of the 

Additionally, the step of adding noise to the far-end audio filler computation processor 140 is coupled to a control input, 

signal can include the steps of estimating noise present in 45 of the adaptive filter 130, and an output of the adaptive filter 

one of a near-end and a far-end environment, and adding 130 is coupled to a subtractive input of the summing device^ 

noise which is audibly similar to the estimated noise to the 120. 

far-end audio signal. Alevel of the noise added to the far-end In operation, a coded far-end audio signal, including 

audio signal can, for example, be made slightly less than an speech of a far-end user (not shown), is decoded via the 

estimated level of the noise present in the near-end envi- 50 decoder 160 and input to the loudspeaker 170 for presenta- 

ronment. lion to a near-end user (also not shown). The loudspeaker 

The exemplary method can further include the steps of output is then echoed back to the microphone 110 via an 

determining whether the near-end audio signal includes unknown and sometimes changing echo path, as is indicated 

speech of the near-end user, and modifying the noise added by a variable transfer function H(z) in FIG. 1 . Thus, audio 

to the far-end audio signal when the near-end audio signal 55 output trom the micropnone liu includes loudspeaker echo, 

includes speech of the near-end user. For example, the added i as well as near-end use r speech and near-end background 

noise can be whitened when the near-end audio signal noise_. 

includes speech of the near-end user. Additionally, a power To prevent the loudspeaker echo from reaching' and 
level of the added noise can be increased when the near-end annoying the far-end user, the FIR filter 130 filters the 
audio signal includes speech of the near-end user. go loudspeaker signal to provide an estimate of the loudspeaker 
The above-described and other features and advantages of echo received at the microphone 110, and the resulting echo 
the invention are explained in detail hereinafter with refer- estimate is subtracted from the microphone output via the 
ence to the illustrative examples shown in the accompanying summing device 120. Echo-canceled output from the sum- 
drawings. Those of skill in the art will appreciate that the ming device 120 is then encoded via the encoder 150 and 
described embodiments are provided for purposes of illus- 65 transmitted to the far-end user. 

tration and understanding and that numerous equivalent At the same time, the echo-canceled output is fed back to 

embodiments are contemplated herein. the filter computation processor 140 for use in adapting the 
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filter coefficients, or laps, of the FIR filter 130 such that they 
converge toward and track the true echo path H(z). As is well 
known in the art, the filter computation processor 140 
computes filter coefficient updates based on both the echo- 
canceled output, or error, signal and the loudspeaker input, 
or reference, signal (e.g., using an LMS or NLMS 
algorithm). As is also well known in the art, coefficient 
updates can be computed either on a sample by sample basis 
or on a sample block by sample block basis. 

Ideally the arrangement of FIG. 1 provides a quality echo 
estimate, and the far-end user is not distracted by delayed 
echoes of his or her own voice. However, as is described in 
the above Background of the Invention, the adaptive algo 
rithm (i.e., the algorithm implemented in the filter compu 



10 



whitening the added noise (and thus the reference signal) 
only when the near-end user is speaking, identification of the 
echo path is improved without degrading overall system 
performance. 

FIG. 2 depicts a bi-directional communications system 
200 in which the above described aspects of the invention 
can be incorporated. As shown, the exemplary system 200 
includes a noise estimation processor 210_ and a second 
summing device 220,-as well as the microphone 110, the first 
summing device 120, the adaptive filter 130, the filter 
computation processor 140, the speech encoder 150, the 
speech decoder 160, and the loudspeaker 170 of FIG. 1. As 
with FIG. 1, those of ordinary skill in the art will appreciate 
that the below described functionality of the components of 
FIG. 2 can be implemented using a variety of known 



iaiw • i- ut \ c k & can oe imi 

tation processor 140) can require a non-neghgible amount of 35 , , ^ \ . . , .. , 

. t , 5 l / hardware configurations, including a general purpose digital 

time to adjust to changes in the near-end echo path (e.g., ** ' & ° r r ° 

when a handsfree microphone is repositioned or when the 

near-end user moves). Moreover, the far-end audio signal 

can at times (e.g., during periods when the far-end user is 



computer, standard digital signal processing components, 
and one or more ASICs, In practice, the exemplary system 
200 includes components (e.g., an analog-to-digital con- 



*i a • i j '• • * » '„ * . «l j , * m verter at the output of the microphone 110 and adigital-to- 

silcnt) include insufficient content to prevent the adaptive ™ an . Mnw * • lt , rt L 



algorithm from diverging away from the true echo path. As 
a result, significant residual echo can reach the far-end user 
during and immediately following periods in which the echo 
path is changing and/or the far-end user is not speaking. 

According to the present invention, however, noise is 
added to the loudspeaker signal during communications to 
improve adaptive filter convergence speed and thereby 
reduce the residual echo experienced by the far-end user, 
even during and immediately following periods of echo path 
change and/or far-end user silence. Adding noise content to 
the loudspeaker signal enables the adaptive filter to more 
quickly track echo path changes and prevents divergence of 
the adaptive filter during periods in which the far-end user 
is not speaking. In other words, by adding noise to the 
loudspeaker signal, more energy is directed through the 
near-end echo path, thereby providing a stronger basis for 
developing and maintaining an accurate filtering character- 
istic (and therefore a more accurate echo estimate). 

To prevent performance degradation from a user perspec- 
tive (i.e., to prevent either user from hearing and being 
distracted by the additional noise), the added noise can be 
made to resemble already-existing system noise. For 
example, the power spectrum and level of noise present in 
either the near-end or far-end environment can be estimated 
in real-time (using known estimation techniques), and the 
noise to be added to the loudspeaker signal can be generated 
having a similar or identical spectrum and a somewhat lower 
power level. Consequently, neither user will identify the 
loudspeaker output as a noise source. 

To further improve adaptive filter convergence speed, the 
spectrum of the generated noise can be whitened, or spread, 
before the noise is added to the loudspeaker signal to provide 
the filter reference signal. Whitening the noise signal (and 
thus the reference signal) serves to excite the adaptive 
algorithm over a broader signal space and thus results in 
improved filter performance. Whitening can be 
accomplished, for example, via forward prediction-error 
filtering, as is well known in the art. See, for example, Simon 
Haykin, Adaptive Filter Theory, Third Edition, Prentice Hall 
Information and System Science Series, 1996, 

So that the users do not perceive a difference between the 
added noise and the existing system noise, the whitening can 
be performed, for example, only when the near-end user is 



analog converter at the input to the loudspeaker 170) which 
are omitted from FIG. 2, as they are not critical to an 
understanding of the present invention. 

In FIG. 2, an audio output of the microphone 110 is 
25 coupled to the additive input of the first summing device 120 
and to a first input of the noise estimation processor 210. . 
Output of the first summing device 120 is coupled to the 
input of the speech encoder 150 and to the first input of the 
filter computation processor 140. The output of the speech 
30 decoder 160 is coupled to a first additive input of the second 
summing device 220 and to a second input of the noise 
estimation processor^ 10. An output of the noise estimation 
processor 210 is coupled to a second additive input of the 
second summing device 220, and an output of the second 
35 summing device 220 is coupled to the reference inputs of the 
adaptive filter 130 and the filter computation processor 140. 
Output of the filter computation processor 140 is coupled to 
the control input of the adaptive filter 130, and the echo 
estimate output of the adaptive filter 130 is coupled to the 
40 sub tractive input of the first summing device 120. 

Generally, operation of the system 200 of FIG. 2 is similar 
to that of the system 100 of FIG* 1. In other words, the FIR 
filter 130 and the filter compulation processor 140 operate to 
provide an echo estimate which is subtracted from the 
45 mic rophone output signal to provide an echo-canceled near- 
ena audio signal to the far-end user. Unlike the system 100 
oi tiG. 1, however, the far-end audio signal is not fed 
directly to the loudspeaker 170 or to the reference inputs of 
the adaptive filter 130 and the filter computation processor 
50 140. Instead, the noise estimation processor 210 generates a 
noise signal, in dependence upon the near-end and far-end 
audio signals, which is added to the far-end audio signal (via 
the summing device 220) to provide the reference, or 
training, signal to the adaptive filter 130 and the filter 
55 computation processor 140. Consequently, the noise estima- 
tion processor 210 can provide the above described advan- 
tages relating to improved filter convergence speed without 
performance degradation. 

For example, the noise estimation processor can include 
60 well known voice activity detectors (VADs) to determine 
when the near-end and far-end users are speaking. Thus, an 
estimate of either the near-end or far-end background noise 
can be updated (using known estimation techniques) when 
the near-end user or the far-end user is silent, respectively. 



speaking and therefore audibly masking the added noise. 65 Further, modeled noise (e.g., noise resembling either the 
Indeed, when the near-end user is speaking, the added noise near-end or far-end background noise) can be added to the 
can be increased in power as well. By strengthening and far-end audio signal only when the far-end user is silent 
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(e.g., in situations where the far-end speech provides a We claim: 

sufficient filter training signal and enhancement is therefore 1. A communications device for providing bi-directional 
desired only when far-end speech is absent). Additionally, audio communications between a near-end user and a far- 
the added noise can be whitened and strengthened, as end user via a bi-directional communications channel, corn- 
described above, whenever the near-end user is speaking s pnsing: 

(and effectively masking the added noise signal). an adaptive echo canceler receiving a near-end audio 

_ „ , . , . , „ M „ signal and providing an echo-canceled near-end signal 

FIG. 3 depicts steps in an exemplary method 300 for for lransmission t0 lhe far ^ nd ^ ^ me communi- 

providing an adaptive filter reference signal according to the cations channel) wherein adaptive filtering coefficients 

invention. The exemplary method 300 can be implemented, ^ 0 f sa j d adaptive echo canceler are dynamically adjusted 

for example, in blockwise fashion (i.e., executed once for m dependence upon the echo-canceled near-end signal 

each of a succession of audio signal sample blocks) within and upon a reference signal; and 

the noise estimation processor 210 of FIG. 2. a 0Q i<# estimation processor receiving a far-end audio 

In FIG. 3, the method 300 begins at step 305, and a via the communications channel and providing 

determination is made at step 310 as to whether the far-end 1S *e reference signal to said adaptive echo canceler, 

user is speaking. If so, then a determination is made at step w ^ Tcm said °°J*? estimation processor produces the 

325 as to whether the near-end user is also speaking. If the * ference b \ «lc^vdy adding noise to the 

both the far^nd and ncar-cad users are speaking then no 2 r c VmumcSs device according to claim 1, 

noise is added to the loudspeaker signal m providing the wherein gaid noise estimation processor in * ludes a voice 

filter reference signal (step 340), and the method 300 ends 20 activi detector fof detcrminmg whcthcr lhe far _ end ai|dio 

at step 355. However, if the far-end user is speaking and the &ignal inc]udes speech of the far . end ^ and wherein said 

near-end user is silent, then the near-end noise estimate is estimation processor adds noise to the far-end audio 

updated at step 330, but still nothing is added to the signal only during periods in which the far-end audio signal 

loudspeaker signal in providing the reference signal (step doe s not include speech of the far-end user. 

340) before the method 300 ends at step 355. 2 5 3. A communications device according to claim 1, 

If it is determined at step 310 that the far-end user is not wherein the near-end user and the far-end user are situated 

speaking, then the far-end noise estimate is updated at step m a near-end environment and a far-end environment, 

315, and a determination is made at step 320 as to whether respectively, and wherein the noise added to the far-end 

the near-end user alone is speaking. If the near-erid user is audio si g nal * estimate of noise present in one of the 

also silent, then the near-end noise estimate is also updated 30 near-end and far-end environments, 

at step 335, and an unmodified (i.e., unwhitened and * A communications device according to claim 3, 

unstrengthened) noise signal resembling the near-end noise wherein a level of the noise added to the far-end audio signal 

or the far-end noise (or a combination thereof) is added to & less 10311 estimated level of the noise present in the 

the loudspeaker signal in providing the adaptive filter ref- near-end environment. 

erence signal (step 345) before the method ends at step 355. 35 5 - A communications device according to claim 1, 

However, if it is determined at step 320 that the near-end wherein said noise estimation processor includes a voice 

user alone is speaking, then a strengthened and whitened activity detector for determining whether the near-end audio 

noise signal resembling one or both of the near-end and si g na3 includes speech of the near-end user, and wherein the 

far-end background noise is added to the loudspeaker signal noise added to the far-end audio signal is modified when the 

in providing the reference signal (step 350) before the 40 near-end audio signal includes speech of the near-end user, 

method 300 ends at step 355. 6 * A communications device according to claim 5, 

. wherein the noise added to the far-end audio signal is 

Generally, the invention provides methods and apparatus whitened when me near . end audio signal includes speech of 

for improving echo canceler convergence speed and stabil- ^ near-end user 

ity. By selectively adding noise to a downlink signal during 4J 7 A conniptions devic6 according to dairn 5, 

communications, echo cancelation is improved without wherein a r leve , of ^ noise added , Q me faMnd audio 

degrading system performance from a aser perspective. The si , Js increased when the near . end audio signal includes 

benefits provided by the techniques of the invention are h of me near<nd ^ 

particularly useful in contexts where the far-end environ- 8 A communications device accoro jng tb claim 1, 

merit is quiet (i.e., where the far-end signal includes rela- J0 wherein said ad Uw ecnQ canceler is an acoustic ^ 

Uvely little content upon which a quality echo estimate can canceler> wherein (he near . en(J audio signa , is , ne „. cnd 

be based), microphone output signal, and wherein the reference signal 

Those skilled in the art will appreciate that the present . is used to feed a near-end loudspeaker, 

invention is not limited to the specific exemplary embodi- 9. A communications device according to claim 1, 

ments which have been described herein for purposes of 55 wherein said adaptive echo canceler is a network echo 

illustration and that numerous alternative embodiments are canceler, wherein the near-end audio signal is output from a 

also contemplated. For example, although the exemplary hybrid junction, and wherein the reference signal is input to 

embodiments have been described with respect to acoustic the hybrid junction. 

echo cancelation in the context of hands-free telephony, the 10. A communications device according to claim 1, 

disclosed adaptation-enhancement techniques are equally 60 wherein the filtering coefficients of said adaptive echo 

applicable to network echo cancelation (i.e., where echoes canceler are adjusted according to a Least Mean Squares 

result from impedance mismatches at a hybrid junction (LMS) algorithm. 

between a digital device and an analog network). The scope 11. A method for suppressing echo in a bi-directional 

of the invention is therefore defined by the claims appended communications device, the device being configured to 

hereto, rather than the foregoing description, and all equiva- 65 provide two-way audio communications between a near-end 

lents consistent with the meaning of the claims are intended user and a far-end user via a bi-directional communications 

to be embraced therein. chanoel, the method comprising the steps of: 
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filtering a near-end audio signal to provide an echo- an adaptive echo canceler receiving the near-end audio 

canceled near-end signal for transmission to the far-end signal and providing an echo-canceled near-end signal 

user via the communications channel; f or transmission to a far-end user via a communications 

dynamically adjusting filtering coefficients used in said channel, adaptive filtering coefficients of said adaptive 

filtering step in dependence upon the echo-canceled 5 ecnQ cance j er being dynamically adjusted in depen- 

near-end signal and upon a reference signal; and dence ^ echo-canceled near-end signal and upon 

selectively adding noise to a far-end audio signal to a reference signal; 

provide the reference signal used in said adjusting step. . t . t . - j j- 

-hi * .t_j j- * i • ii i- ■ + j . r r a noise estimation processor receiving a far-end audio 

12. A method according to claim 11, wherem said step of - , . « . • 1 
selectively adding noise comprises the steps of: 10 a en al v,a communications channel and providing 

. 4 .. , , tL f . . i • i j the reference signal to said adaptive echo canceler, 

determining whether the far-end audio signal includes r ... 

speech of the far-end user; and wherein said noise estunatl0n Processor produces the 

, ,. j • , I.- reference signal by selectively adding noise to the 

adding noise to the far-end audio signal only during ° . ' ' ° 

periods in which the far-end audio signal does not ]5 far " end audA0 S1 &* 1 > and 

include speech of the far-end user. a loudspeaker receiving the reference signal and provid- 

13. A method according to claim 11, wherein the near-end ing sound to the near-end environment. 

user and the far-end user are situated in a near-end environ- 20. A handsfree telephone according to claim 19, wherein 

ment and a far-end environment, respectively, and wherein sa j d noise estimation processor includes a voice activity 

said step of adding noise to the far-end audio signal includes 20 detector determining whether the far-end audio signal 

the steps of estimating noise present in one of the near-end includes speech of the far-end user, and wherein said noise 

and far-end environments and adding noise which is audibly estimation processor adds noise to the far-end audio signal 

similar to the estimated noise to the far-end audio signal Qal ^ riods m which ^ far . end ^ s[ y does 

14. Amethod according to claim 11, wherein a level of the not include Q of the far ^ nd ^ 

noise added to the far-end audio signal is less than an n< A , , c < 1 1. j- < , ■ in u 

. P , . 23 21. A handsfree telephone according to claim 19, wherein 

estimated level of the noise present in the near-end envi- 4l _ . . . % . . . . . c 

r the noise added to the far-end audio signal is an estimate of 

ronment. . , f , 

15. A method according to claim 11, further comprising noise P resent in one of ^ near ' eDd environment and a 
the steps of: f*** 0 * environment. 

, . . . . , . . . , , 22. A handsfree telephone according to claim 19, wherein 

determinmg whether the near-end audio signal includes 30 , . e . , *, , t . - * . ' . . 

, % t , . j a level of the noise added to the far-end audio signal is less 

speech of the near-end user; and , .,,11.,. . « 

. , „ J than an estimated-level of the noise present in the near-end 

modifying the noise added to the far-end audio signal environment 

when the near-end audio signal includes speech of the a hands&ee [d hQne &CCQtdiog [0 claim 19> wherem 

near-end user. -j • *■ *• • 1 j •* 

16. Amethod according to claim 15, wherein said step of 35 ™ d noise estimation processor includes a votce activity 
modifying the added noise includes the step of whitening the det f «** for determining whether the near-end audio signal 
added noise when the near-end audio signal includes speech mcludes s P eech oC the near - end user > and wh = rcln me noise 
of the near-end user added to the far-end audio signal is modified when the 

17. A method according to claim 15, wherein said step of near-end audio signal includes speech of the near-end user, 
modifying the added noise includes the step of increasing a 40 24. A handsfree telephone according to claim 23, wherein 
power level of the added noise when the near-end audio the noise added to the far-end audio signal is whitened when 
signal includes speech of the near-end user. the near-end audio signal includes speech of the near-end 

18. Amethod according to claim 11, wherein the filtering user. 

coefficients are dynamically adjusted according to a Least 25. A handsfree telephone according to claim 23, wherein 

Mean Squares (LMS) algorithm. 45 a power level of the noise added to the far-end audio signal 

19. A handsfree telephone, comprising: is increased when the near-end audio signal includes speech 
a near-end microphone receiving sound from a near-end of the near-end user. 

telephone environment and providing a near-end audio 

signal; ***** 
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[57] ABSTRACT 

A method and communication system (10), which can be 
used in a full, pseudo full, or half duplex speaker-phone 
system, includes a circuit (14) to remove a frequency 
component from a received signal to output a modified 
received signal, and a speaker (18) to convert the modified 
received signal to an acoustic signal (20). The circuit (14) 
may be, for example, a notch filter, which may remove a 
sharp frequency range between about 1250 Hz and 1550 Hz. 
A microphone (26) converts a second acoustic signal (24) to 
an electrical signal for transmitting, and a detector (30) 
produces a detector output if the removed frequency is 
present in the acoustic signal (24). A circuit (34) is also 
provided to modify a circuit parameter applied to the 
received signal when the detector (30) produces a detector 
output. The parameter may be, for instance an attenuation 
applied to the received signal. 

38 Claims, 3 Drawing Sheets 
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METHOD AND APPARATUS FOR four possible modes of speaker-phone operation: idle, far- 

DETECTING SPEECH AT A NEAR-END OF A end speech only, near-end speech only, and double talk, in 

COMMUNICATIONS SYSTEM, A SPEAKER- which both near-end and far-end speech occur simulta- 

PHONE SYSTEM, OR THE LIKE neously. The control logic module then uses operating mode 

BACKGROUND OF THE INVENTION 5 information to control the echo canceler circuit adaptation 

j r" u c *l t process and to switch losses into the loop, for example by 

1, Field of the Invention v . . - . , r . P , 1 
_ . . , . modifying the operation of the near-end and far end speech 
This invention relates to improvements in communication ■.»_.■•• n i * ♦ i *u n 

, . • i units to maontain an overall loop gain at less then 0 dB. 

systems, and more particularly to improvements in speaker- ...... „ 4 . , 4 . « a * ^ * 

v 4 j r ni *• i i . * * Additionally, the determinations generally are used to admst 

phone systems, and still more particularly to improvements . , , . J . , - , . , ,~ ' , . . * 4 

in methods and apparatuses for detecting the presence of 10 * e ^esholds of the received and transmitted signals, so that 

speech in a speaker-phone system, or the like. a volume of near-end audio is required to initiate 

„ „ , . i .i a particular transmission and attenuate the received signal. 

2. Relevant Background r „ . „ . - 
c . , . . , . Echo cancellation circuits generally operate in one or 
Speaker-phone systems in widespread use are systems by . _„ <, . , * 

■ . i . j * * j i • „ * i l. • » three modes. When near-end speech that exceeds a prede- 

which mdividuals communicate over telephone, intercom, 1C . " . , . , t , 5 , . 4 F . 

rtw . „ a ,_ r „- ,1 , ><u.^a<* f r ~* » 15 termined threshold is detected, one mode is to merely 

radio, or other transmission media, essentially hands tree. . .. , . , 

T \ • 1 . . . » • . continue the same transmitter and receiver parameter values 

In a typical speaker-phone system, both communication VTV 10 ^"". , K iaiu * ^ l *"™ 

. . ...... 1 . ■ .u j. as existed just before the time at which the near-end speech 

ends have a transmitter to translate a voice or other sound to , t ■ ' . - . , . . f. , 

, . ■„ j • * 1 * ■ 1 • ic. •• was detected. Another mode of operation is to selectively 

be transmitted into electrical signals for transmission, nor- ..... - a ■ 1 <r / *u ♦ * - * 

„ ■ . j ■ . . 1 * .u switch the received signals off and on, so that, for instance, 

mally using a microphone, and a receiver to translate the „ L , , h . , 4 4 , . / ' . .... 

■ j ■ 1 * * v * * 11 • 20 when near-end speech is detected, the transmitter is switched 

received signals into sound for listening, normally using a . . , , ' . . . , . 

A f „ ~f « L ta ' A nf 7. on and the receiver is switched off. This is probably the most 

speaker. As a matter of convention, a remote end 01 the ... ... * rn >. • . . r . • j • 

r A . c ... t L* ( r j»j 1 j widely used technique. The third mode, which is used in 

system is referred to herein as the far-end, and a close end J ,. „ , \ . . . . . 

• r j * « j jj more sophisticated full duplex systems, is to modify or 

is referred to as the near-end. . .. ■ , .« . j4 , 

, , , t apportion attenuation between the received and transmitted 

Although most speaker-phone systems use a half duplex M sj ^ to control ^ overaU b jn fo a constan , value> 

mode of operator., in wh.ch only one speaker can talk at any kss (han 0 dB> thereb avoidi ^ undesired oscillations or 

tune, some speaker-phone systems use either a full duplex or ^ desc[ihed above 

a pseudo-full duplex mode of operation, in which both ™ . . , * 

communication ends transmit and receive simultaneously. . ™ us > » operation, when a near-end and far-end connec- 

Howevcr, in full and pseudo full duplex systems if the 30 uon is estabtehed an automaUc miUahzaUon and calibraUon 

speaker and microphone are positioned too closely together of the near-end and far-end unite is generally performed. In 

or to common reflecting surfaces, such as walls, or the like, ^ initaltzation and cahbration procedure, die loop attenu- 

or if the speaker volume is set too high, a portion of the atl0n Jf ^ brated and <*f °P« ata g parameters .of the echo 

received signal is fed back into the transmitting path, often "ncelUdon circuits are determined and adjusted, depending 

causing unwanted echos of varying magnitude. In some 35 u P° n . ! he P artlcular environment* and background noise 

cases, the systems may oscillate or squeal, particularly if condiUons existmg at the time of calibration. However, even 

similar conditions exist at both the near and far ends of the ^ ben such c ! rcults > F™>«*t techniques, and echo cancel- 

lation procedures are used, the problems described above are 

m. • ' 1 1 1 1 ,j « . . r, not totally eliminated. For example, if one user suddenly 

This problem has been addressed in various ways. For . t i , r . . , - c t 

. y , A . J changes the volume of his speaker, or if a noise source, such 

example, most speaker-phone systems use an echo cancel- 40 c jj 1 * _* * 1 u ♦ ' »-n 

. A . r / l j * *L . * . ■ , . u as a ran, suddenly starts, most speaker-phone systems still 

lation circuit at each end of the system. A typical echo . ; , ' . . , r . .% J 

„ . - . ... , /K 4 have a tendency to squeal, at least until a new automatic 

cancellation unit, for instance at the near-end 01 the system, . v .. \ -.Z 1 , . c , 

. . . . -jr .ur j- initialization and calibration procedure can be re-periormed. 

processes a signal that has been received from the far-end in ...... . t 

a far-end speech module, which computes the power of the U can bc sccn that approach and its variants may lead 

received far-end signal before passing the received signal to 45 to ^correct decisions regarding the particular operating 

the speaker or audio circuitry. Usually, the far-end speech modf i™ w hich the speaker-phone system should be operat- 

module calculates the power over both short and long-term m f thresholds described above are commonly deter- 

time windows. The echo canceler unit. also receives the mulcd empirically, and are not robust to changes in cnvi- 

signal that is being transmitted from the near end, and ronmental conditions, often resulting in a poorly performing 

processes it in a near-end speech module, which computes 50 speaker-phone system. 

the power of the near-end signal before passing it to the Wnat i s needed is a method and apparatus to accurately 

transmitter circuitry. Usually, the near-end speech module detect tne presence of near-end speech to assure quality of 

also calculates the power over both short and long-term time transmission and to control feedback of the received far-end 

windows. signal. 

The power computations of the far and near end speech 55 SUMMARY OF THE INVENTION 
modules are then processed in a control logic module, 

which, depending upon the relative power ratios that are In light of the above, therefore, it is an object of the 

calculated, modifies the characteristics of line and acoustic invention to provide an improved communication system, 

echo cancellation circuits. For example, if the ratio of the It is another object of the invention to provide an 

short term power to long term power exceeds a predeter- 60 improved communication system of the type described that 

mined threshold, the control logic module determines that can be employed in an improved speaker-phone system, or 

far-end speech exists. If far-end speech is determined to be the like. 

present, a further comparison is made between the relative p It is another object of the invention to provide a method 

short term powers that are computed to determine if near- land apparatus to accurately detect the presence of near-end 

end speech also is present. 65 (speech to assure quality of transmission and to control 

Proper analysis of the far-end and near-end speech signals (feedback of the received far-end signal in a speaker-phone 

enables the control logic to accurately discriminate between Igystem, or the like. 
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It is still another object of the invention to provide a 
method and apparatus in which speaker-phone thresholds 
can be robustly changed in response to changes in environ- 
mental acoustic conditions in which the system is used. 

It is yet another object of the invention to provide an 
improved method and apparatus of the type described that 
can be used in conjunction with existing speaker-phone 
systems to improve the performance thereof. 

It is yet another object of the invention to provide an 
improved method and apparatus of the type described that 
can be used in conjunction with existing half, full, or pseudo 
full duplex speaker-phone systems, or the like. 

These and other objects, features, and advantages will 
become apparent to those skilled in the art from the follow- 
ing detailed description, when read in conjunction with the 
accompanying drawings and appended claims. 

According to a broad aspect of the invention, a speaker- 
phone system for communication between a near-end and a 
far-end is presented. The speaker-phone system includes an 
acoustic speaker at the near-end and a receiver for receiving 
a signal from the far-end for driving the acoustic speaker. A 
microphone copverts an acou stic signal at the near-end to a 
signal to be transmitted ( which may include a portion of the 
acoustic signal producedby the acoustic speaker), and a 
transmitter transmits tne signal to be transmitted to the 
far-end. A circuit provides a modified frequency content to 
the received signal in a frequency range that may be con- 
tained in the signal to be transmitted. 

The modified frequency content can be, for example, be 
accomplished by removal from the received signal of at least 
one frequency or range of frequencies that is contained in the 
signal to be transmitted. A detector is provided to detect the 
presence of the modified frequency content in the signal "fb 
be transmitted, which would indicate the existence of a 
near- end audio signal to be transmitte d. A circuit is con- 
nected to modify at least one parameter of the speaker-phone 
system when the detector detects the presence of the modi- 
fied frequency content in the signal to be transmitted. 

The circuit for modifying a frequency content of the 
received signal can be, for instance a notch filter or the like, 
which filters a band of frequencies within an audio range of, 
for instance, between about 1000 Hz and 1500 Hz, with a 
center frequency of about 1400 Hz. The detector can be a 
notch filter or a power detector to detect power within the 
filtered frequency range. 

According to still another broad aspect of the invention, 
a method for operating a speaker-phone system for commu- 
nication between a near-end and a far-end is presented. The 
method can preferably be used in conjunction with a 
speaker-phone system of the type which has an acoustic 
speaker at the near-end and a receiver for receiving a 
received signal from the far-end for driving the acoustic 
speaker. The speaker-phone system also include a micro- 
phone for converting an acoustic signal at the near-end to a 
converted signal to be transmitted and a transmitter for 
transmitting the converted signal to the far-end. The method 
includes modifying a frequency content of the received 
signal within a frequency range that is contained in the 
converted signal, then detecting the presence of the modified 
frequency content in the converted signal. The method also 
includes modifying at least one parameter of the speaker- 
phone system when the modified frequency content is 
detected in the converted signal. 

The step of modifying a frequency content of the received 
signal within a frequency range that is contained in the 
converted signal may include filtering the received signal to 
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remove from the received signal at least one frequency or a 
band of frequencies that is contained in the converted signal. 
The step of modifying at least one parameter of the speaker- 
phone system when the modified frequency content is 
S detected in the converted signal may include adjusting an 
attenuation of the received signal or adjusting parameters of 
line echo cancellation circuit and an acoustic echo cancel- 
lation circuit. 

According to yet another broad aspect of the invention, an 

]0 improved communication system is presented. The commu- 
nication system includes a circuit for removing a frequency 
component from a received signal to output a modified 
received signal, and a speaker for converting the modified 
received signal to an acoustic signal. The circuit may be, for 
example, a notch filter, which may remove a sharp frequency 

15 range between about 1250 Hz and 1550 Hz. A microphone 
converts a second acoustic signal to an electrical signal for 
transmitting, and a detector produces a detector output if the 
removed frequency is present in the acoustic signal. A circuit 
Tr^Tfr n-p nr'irirrl ffi Piojify rirm iit p n rnrn viHr HpftliVfl to the 

20 received signal when the detector produces a detector out- 
put. The parameter may be, for instance an attenuation 
applied to the received signal. 

According to yet another broad aspect of the invention, a 
method for processing communication signals is presented. 

25 The method includes removing a frequency component from 
a received signal to produce a modified received signal and 
converting the modified received signal to a first acoustic 
signal. The method also includes converting a second acous- 
tic signal to an electrical signal for transmitting, the second 

30 acoustic signal possibly containing a portion of the first 
acoustic signal, and modifying circuit parameters applied to 
the received signal if the frequency component is present in 
the acoustic signal. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 The invention is illustrated in the accompanying 

drawings, in which: 

FIG. 1 is a block diagram showing the theory of operation 

of a system for detecting near-end speech in a communica- 
40 tion system, in accordance with a preferred embodiment of 

the invention. 

FIG. 2 is a block diagram of a system for detecting 
near-end speech that may be implemented with software, in 
accordance with a preferred embodiment of the invention. 
45 FIG. 3 is a graph of attenuation vs frequency of a typical 
filter that may be used to filter a received far-end signal, in 
accordance with a preferred embodiment of the invention, 
and which is generated by the computer program of TABLE 
1. 

50 FIG. 4 is a graph of attenuation vs frequency of a typical 
detector that may be used to detect a transmitted near-end 
signal, in accordance with a preferred embodiment of the 
invention, and which is generated by the computer program 
of TABLE 1. 

ss FIG. 5 is a graph of amplitude vs time for a typical 
near-end speech pattern. 

And FIG. 6 is a graph of the magnitude of a detector 
output vs time for the speech pattern of FIG. 4, generated by 
the computer program of TABLE 1, in accordance with a 
60 preferred embodiment of the invention. 

In the various figures of the drawings, like reference 
numerals are used to denote like or similar parts. 

DETAILED DESCRIPTION OF THE 
65 PREFERRED EMBODIMENTS 

A block diagram of a system 10 for detecting near-end 
speech in a communication system illustrating the theory of 
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operation of a preferred embodiment of the invention is 
shown in FIG. 1. The system 10 can be used in a full, pseudo 
full, or half duplex speaker-phone or other communication 
system, which receives an incoming or received signal on 
line 12, which may be, for example, from a telephone wire, 
communication link, or other source. The received signal on 
line 12 is filtered by a filter circuit 14 to remove a frequency 
component from the received signal to output a modified 
received signal to an amplifier 16. The circuit 14 may be, for 
example, a notch filter, which may remove a sharp frequency 
range centered about, for example 1400 Hz, between about 
1250 Hz and 1550 Hz. As will become apparent, the 
particular frequency that is removed is not critical, but 
preferably should be within a frequency range that is gen- 
erated by an acoustic signal that will be transmitted as an 
outgoing communication signal, described next below. A 
speaker 18 receives the output from the amplifier 16, and 
converts the modified received signal to an acoustic signal 
20. 

The acoustic signal may traverse various paths, one of 
which may be directed to a microphone 26, which is 
intended to receivejjocal acousticj ignal 23, for example, 

iown), for outgoing 



from a local individual or speaker (not 
transmission. The microphone 26 converts the local acoustic 
signal 23 and any portion of the reflected received acoustic 
signal 24 to^afi^leciricahsignal^ox^ujgoing transmission, 
which is amplified by an amplifier 28, and~applied~to- 
tctor 30. 

The detector 30 detects (he presence of a signal that is 
within the frequency range that was removed from the 
incoming signal by the filter 14, and operates to produces a 
detector output if the removed frequency is found to be. 
present in the acoustic signal 24. The detector 30 may 
contain, for example, a band pass filter designed to pass 
signals at the frequency removed by the notch filter 14, such 
as, in the example described, between about 1250 Hz and 
1550 Hz, at a center frequency of about 1400 Hz. 
Alternately, the detector 30 may include a power detector 
that detects the existence of signal power in the frequency 
ranges of interest. Although the detector 30 is shown in a 
series path between the amplifier 28 and the output line 32, 
it will be appreciated that its function is merely to detect the 
presence of the removed frequency; consequently, it can 
easily be realized by merely sampling the output signal from 
the amplifier 28 

A circuit 34 receives the outp ut from the detector 3< 
conTrftK ane or rqpre ^ paramCTers' of the circuit 10. For 
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example, _the circuit 34 may be us ed to adjust the gain or 
a ttenu ation appli ed by the amplifier l b to the 1 Input Ur 
"received signal rrom line 1.2. In some embodiments, for 50 
example, in which half duplex operation is used, the circuit 
34 may control a switch (not shown) to completely turn off 
jhe incoming received signal when n e ar-end speech^ is 
^ieiectea to exist. Additionally, the amplifier may be 
connected, as shown , to the amplifier ffi to control the flain^ ** 
applied to the converted acoustic signal to be transmitted. 



A block diagram of a speaker-phone system 50, employ- 
ing software controlled circuitry for providing and detecting 
a frequency modification of the received signal, in accor- 
dance with another preferred embodiment of the invention, 
is shown in FIG. 2, to which reference is now made. The 
speaker-phone system 50 is used to communicate between a 
speaker 42 at a far-end 43 and a speaker 45 at a near-end 46, 
and includes a echo cancellation unit 51 to monitor the 
incoming and outgoing signals, as described below. It will 
be understood that although a single echo cancellation unit 
51 is described with respect to the near-end 46 of the 
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speaker-phone system 50, a similar echo cancellation unit 
(not shown) may be used in conjunction with the speaker- 
phone system at the far-end 53. 

Since most speaker-phone systems are essentially 
telephonic, the far-end equipment is generally fortuitously 
configured in whatever manner in which it may be found, 
and may be beyond the control of a near-end user. 
Consequently, the duplication of the near-end system at the 
far-end, although preferable, is not at all necessary. It should 
also be noted that although the speaker-phone system 50 is 
described in conjunction with actual human speakers at the 
far-end 43 and near-end 46, the principles of the invention 
apply equally well to acoustic systems in general, for 
example, where one or the other users are non-human, such 
as a computer, a synthesized voice system, or the like, or in 
noise monitoring systems, such as so-called "baby 
monitors " or other such systems. 

The echo cancellation unit 51 includes a far-end speech 
analysis and modification circuit 55 which receives a signal 
on the near-end receiving line 68, which is amplifie d by a 
receiver amplifier 69. The far-end 43 circuitry is represented 
by a box 72 labeled "hybrid," which can be constructed in 
a desired manner to output audio signals on line 68, and 
receive input audio signals on line 70 from the near-end 46. 
As mentioned, the hybrid circuit 72 may be constructed 
similarly to the echo canceler unit 51 at the near-end, but this 
is not a critical requirement. The output from the far-end 
speech analysis and modification circuit 55 is amplified by 
a power amplifier 73 at the near-end 46, and applied to an 
icoustic speaker 75. 

\a microphone 80 is provided to receive the acoustic 
signal, or voice, from the near-end speaker 45. The low level/ 
electrical output from the microphone 80 is amplified by 
preamplifier 78 and conducted to a near-end speech analysi^ 
andunodification circuit 56, the output of which is amplified) 
by an output amplifier 71 for delivery to the far-end on fin 
70. 

T)^pically, a certain amount of the acoustic signal 76 
generated by the speaker 45 is directed or redirected into the 
microphone 80. This may be due, for example, to reflections 
from nearby walls or surfaces, or perhaps from a direct path 
of travel between the speaker 75 and microphone 80. To 
rejmice t he effects of such fed-back acoustic signal energy, an 
udio echo cancellation circuit 62 Is providea. ine~'echo 
cancellation circuit 62 receives a portion or sample of the* 
output of from the far-end speech analysis and modification 
circuit 55, weights it, and subtracts it from the signal from 
the preamp lifier 78 in a summer circuit 63 , before the 
outgoing near-end signal is processed by the near-end 
speech analysis and modification circuit 56. Similarly, a fine 
echo cancellation circuit 63 is provided that samples a 
portion of the output signal from the near-end speech 
analysis and modification circuit 56, weights it, and sub- 
tracts it from the incoming signal from amplifier 69 by a 
second summer circuit 61 before the incoming signal is 
applied to the far-end speech analysis and modification 
circuit 55. 

The far-end speech analysis and modification circuit 55. 
the near-end speech analysis and modification circuit 56, the 
line echo cancellation circuit 63, and the acoustic echo 
cancellation circuit 62 are all controlled by a control logic 
section 58, which may conveniently be a digital signal 
processor, such as a TMS320C25, sold by Texas Instruments 
Incorporated, which may be programmed by a suitable 
digital computer program. (In fact all of the elements in 
block 51, including the echo cancellers 61 and 62, may be 
implemented by the digital signal processor.) 
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More particularly, the far-end speech analysis and modi- 
fication circuit 55 can include a bank of digital notch filters, 
each individually tuned to a different frequency. The near- 
end speech analysis and modification circuit may be a bank 
of power measuring circuits, each for measuring the power 
at the same frequency bands corresponding to those pro- 
vided by the notch filters of the far-end speech analysis and 
modification circuit 55. The power measuring circuits, 
moreover, can measure the power over both short and long 
term time windows. If the short term power exceeds the long 
term power at the frequencies of interest, then it follows that 
near-end speech must be present because the outgoing 
far-end speech did not contain any power in those frequency 
bands. 

In addition, one or more of the digital notch filters that are 
provided within the far-end speech analysis and modifica- 
tion circuit 55 remove energy at a predetermined frequency, 
or set of frequencies. These removed frequencies are pref- 
erably in the range of frequencies that would be generated by 
whatever acoustic source that is generating the near-end 
acoustic signal. For example, if the near-end speaker is a 
human voice, the frequency band can be in the range of 1250 
Hz to 1550 Hz, with a center frequency of about 1400 Hz. 
Of course any convenient frequency band or bands can be 
used, the choice of which not being critical. 

Trie near-end speech analysis and modification circuit 56 
also has, in addition to its normal power measuring circuits, 
one or more circuits to detect the presence of a frequency 
content in those frequency bands removed by the digital 
notch filters that are provided within the far-end speech 
analysis and modification circuit 55. This is done preferably 
by a power detector, although other detection techniques 
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may be used. The control logic section 58 then determines 
that near-end speech exists if the energy in the power 
measuring circuits of the near-end speech analysis and 
modification circuit 56 in the frequency bands removed by 
the digital notch filters that are provided within the far-end 
speech analysis and modification circuit 55 exceeds a noise 
floor energy, as measured by the long term power levels. If 
near-end speech is determined to exist, any number of 
actions can be taken. For example, the attenuation of the 
received signal on line 68 can be attenuated, or turned off 
completely, and the outgoing signal from the preamplifier 78 
further boosted. Or the relative gains and attenuations of the 
transmitted and received signals can be continuously 
adjusted to vary the loop gain of the circuit to maintain it at 
a constant level less than 0 dB. This may be particularly 
useful in full duplex systems. Alternatively, the parameters 
of the line and acoustic echo cancellation circuits 62 and 63 
can be modified to increase or decrease the signal levels that 
are subtracted from the received and transmitted signals to 
provide maximum echo cancellation. Other appropriate 
parameter modifications will be apparent to those skilled in 
the art. 

An example of a program (MATLAB M-file) that can 
remove frequency content from the far-end speech signal 
and test for the presence of the removed frequencies in the 
signal transmitted from the near-end is set forth below in 
TABLE 1. It is noted that in the code' of TABLE 1, the 
far-end noise is simulated with random noise, and the 
near-end signal is simulated with an assumed power and 
signal to noise ratio, and with an assumption that the 
near-end speaker is silent during the first 1 second of the 
conversation. 



TABLE 1 



% initialize the random number generator 

rand( 4 seeo", sum(clock)*10G); randn('seed', sum(clock)*100); 
% far-end speech signal (w/additive wgn) 

% this signal stimulates the speaker on the far-end and the noise is added 
% to simulate a typical signal to noise ratio of a phone line (e.g. 40 db) 
% (The file 4 c:\speech\dam_f,spd'*is a 16-bit signed integer format 
% file that contains sample speech data) 
phones nr=40; 

fid l=f open ('c:\speech\dam f.spd'.V); 

fes=frcad(fidl, 'mtl6'); fes=fes(:)'; fes=fes/32768; sigpow=sum(fes. A 2); 

sig_noise=randn(size(fes)); noisepow=sig_noise. 2); 

fes«fes+sqrt(s igpow/(l 0 (phonesnr/1 0) * no isepow)) *sig__noise; 
% the following simulates the idea of removing frequency content from the 
% outgoing signal, tn this simulation, a single notch filter is used; however, 
% other filters or combination of filters can be used for the same purpose. 
% example notch filter designed to remove frequency content in a neighborhood 
% around 1400 Hz. 

fs=8000; nfreq=1400; w0-2*pi*nfreq/£s; 

r-0.97; k2-l-r 2; kl-2*sqrt(l-k2/2)*sin(w0/2); 

b-((2-k2)/2)*[l -2*(2-k2-kl *2)/(2-k2) Ij 

a«fl -(2-k2-kl"2) (l-k2)J 
% plot and print frequency response 

clg: 

[h J pMreqz(b,a,512); 
H-dBCabs^lOOjO); 
v-[0:511]«4000/511; 
plot(v,H); 

ylabcl(* Magnitude Response (dB)'); 
xlabel(' Frequency (Hz)'); 

title(' Example Notch Filter Frequency Response'); 
pause 

% frequency content modification of outgoing signal via notch filtering 

fes_mod -fUter(b } a,fe3) ; 
% acoustic echo path 

% the following represents a typical acoustic echo path as described 

% in the literature (e.g. Perez, et al.; 'Acoustic . . . 'IEICE Trans. Vol. 24 1991) 
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TABLE 1 -continued 



% Note: there is no loss of generality in using this particular echo path. 

M»32; % time duration of echo path (in 8 KHz samples) 

pathgain—6; % acoustic echo path gain 

hcl-(raDd(l,M> :; 0^).*(10. (-6*[0:M-1^M)); 

hcl»hcl*sqrt(10"(pathgaifl/10)/suin(hcl. 2)); hcl«hcl(:)'; 
% create returned signal which is composite of: 
% 1) far-end speech returned signal 
% 2) near-end speaker, and 
% 3) environmental noise 

N-3"8000; % number of samples used in simulation 

% far-end speech returned signal 

% signal is assumed to be a convolution of echo path and far-cod signal 

fes_ret»conv(hcl,fes^nod(M+l :N)); 
% near-end speaker 

% this signal represents the near-end speaker who would be located in the 
% room (acoustic chamber) and speaking into the microphone. 

% (The fUe 'c:\speech\dam m.spd' is a 1 6-bit signed integer format 

% file that contains sample speech data) 

fid2-fopen('c;\speech\dam_mipd'/r') 

nes»fread(fid2, *intl6*); nes=nes(:) nes=*nes/32768; 

nes»nes(l :max(size(fes_ret))); 
% set the first 8000 samples to zero to simulate that the neai-end speaker 
% is silent during the first 1.0 seconds of the conversation 

nes(l :8000)-zeros(l ,8000); 

sp__ret-fes_ret+nes; 
% add wgn to achieve a given snr 

invsnr-15; 

sigpow-sum(sp_ret. 2); 

sig noise-randn(size(sp_ret)); noisepow-sum(sig noise. 2); 

sp ret-sp_ret+sqrt(sigpow/(l 0 (envsnr/1 0) * noisepow)) *sig__noise; 

% near-end speech detector 

% shorttenn (12 ms) power window constants 

shortwin-12* 1 0 "(-3)/(125 *10 " (- 6)); sbeta-2* (- 15/shortwin); 

longwin«100*10*(-3)/(125*10*(-6)); lbeta=2*(-l5/longwin); 
% filter returned signal to isolate frequency bands of interest and compute 
% the short-term power sequence associated with those bands to simulate the 
% section of the detector that analyzed frequency bands of interest and 
% computes long and short term power estimates 
% detection filter specifications 

fs=8000; nfreq-1400; w0-2*pi*nfreq/£s; 

r=0.99; k2»l-r*2; kl»2*sqrt(l-k2/2)*sin(w0/2); 

b«(-k2/2)*[l 0 -1]; 

a=[l -(2-k2-kl A 2) (l-k2)]; 
%plot and print frequency response 

[h,p>freqz(b,a,512); 

H«dB(abs(h),100,0); 

v-(0:5n]'4000/511;. 

plot(v,H); 

y label ('Magnitude Response (dB)*); 
xlabel(* Frequency (Hz)'); 

title(' Example Detection Filter Frequency Response'); 

pause 
% filter signal 

temp =filter(b ,a^p_ret); 

short .jJowafilterCfl-sbetaJ^l-sbetaJjlemp.^); 

long_pow«filler((l -Ibeta) \ 1 -lbeta],temp. *2); 
% calculate ambient noise floor as the short term power that we 
% measure during the initialization period 

nesp^hort_po w>— 2. 5 * max (1 ong_pow(l : 4000)); 
% plot signals 

clg; 

subp!ot(211), plot(nes); 

title(' Near-End Speech Signal*) 

subplot(212), plbt(nesp); 

title(* Near-End Speech Detector output') 

print 



The attenuation of the received far-end signal produced likely be contained in a near-end speech signal, but since the 

by the filler of the program of TABLE 1 is shown in FIG. 3. 60 30 o Hz bandwith is relatively narrow, its attenuation or 

Design techniques to realize a software or hardware version remova] from ^ feceived si d would m{ si 

of a filter having similar notch characteristics as those , , , „ f , • , , 

described are well known, and can be readily accomplished canUv reduce ^ 0VeraU 9 uaht y of received far " end 

by those skilled in the art. It can be seen that the filter signal. 

produced is a notch filter, having a center frequency at about _ . . , , 

1400 Hz. The filter characteristics are fairly sharp, attenu- 65 The frequency response of the frequency detector is 

ating a frequency range between about 1250 Hz and 1550 shown in FIG. 4, and has a similarly sharp frequency 

Hz. This particular frequency range is one that would most detection response centered about 1400 Hz. It can be seen 
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that the detector has a relatively sharp response, falling off 4. The speaker-phone system of claim 3 wherein said 

about 25 db at about ±150 Hz from the center frequency. detector is a notch filter. 

A typical near-end speech signal is shown in FIG. 4, 5 - ^ speaker-phone system of claim 3 wherein" said 

which might be generated by a speaker at the near-end. The notch filter filters frequencies between about 1250 Hz and 

near-end speech signal in the computer code generated 5 1^50 Hz. 

above in TABLE 1 is separately inserted from a disk file, but 6 - The speaker-phone system of claim 4 wherein said filter 

contains a frequency component at about 1400 Hz. has a cemer frequency of about 1400 Hz, 

The output from the detector simulated by the code of 7 " ^ c speaker-phone system of claim 1 wherein said 

TABLE 1 in response to the near-end speech signal of FIG. c ° nt '° 1 emprises a circuit to adjust an attenuaUon of 

4 is shown in FIG. 5. It can be seen that the constant 10 ^e first signal 

amplitude outputs generated correspond to the discrete seg- 8 * ™ e speaker-phone system of claim 1 wherein said 

ments of the speech signal. control circuit comprises a circuit to adjust parameters of a 

' ^ A ^ r „ line echo cancellation circuit and an acoustic echo cancel - 

The computer program of TABLE 1 is primarily intended i au0 n circuit 

to demonstrate the theory of operation of the invention. Tte J5 „ ^ s eaker hone s slem of claim l wherein ^ first 

outputs are provided in the form of printed plots or graphs circuit for providing a ^own frequency content uses fre- 

of the detector for detecting the presence of frequencies that domain oper4tions . 

are expected to be contained in the near-end speech. 10 ^ speak hone s ^ tem of claim x wherein me first 

However, one skilled in the art would find connection of the circuit for providing a known f reqU6 „ C y content provides a 

output from the detector to control parameters of the com- M steady state response 

munition circuit embodying the invention readily appar- ' u ^ sp6aker . pnone systcm of claim 10 whc rein the 

en *' response is a steady-state frequency response. 

Although the invention has been described and illustrated 12. A speaker-phone system for communication between 

with a certain degree of particularity, it is understood that the a near-end terminal and a far-end terminal, comprising: 

present disclosure has been made only by way of example, u afl acousUc kef at ^ neaf _ end Xm ^. 

and that numerous changes in the combination and arrange- . - .. « . • , f * u *- j 

m . r „ nn . * nriaA u„ *u™ ^UoA ■« *u a a receiver for receiving a first signal from the far-end 

ment or parts can be resorted to by those skilled in the art 4 , f , . . f, * . 

* *i« * j — * * p * • ■% . terminal for dnving the acousUc speaker; 

without departing from the spirit and scope of the invention, . B F 

as hereinafter claimed. a nnciophone for converting an acoustic signal at the 

What is claimed is* near-end terminal to a second signal to be transmitted, 

1. A speaker-phone system for communication between a 3 ° microphone being acoustically coupled to the 
near-end terminal and a far-end terminal, comprising: acoustic s P eaker at the near-end terminal; 

an acoustic speaker at the near-end terminal; a transmitter for transmitting said second signal to be 

, „ , , transmitted to the far-end terminal; 

a receiver for receiving a first signal from the far-end rAl £ - t . £ t , . . A , 

* 1 r j ■ • !l 1 i< a filter tor removing from the first, received signal at least 

terminal tor dnving the acoustic speaker; c *u * • * • j * j j • i* 

* r one frequency that is contained in said second signal to 

a microphone for converting an acoustic signal at the ^ e transmitted- 

near-end terminal to a second signal to be transmitted, , detector tQ ^ the ^ of said A , east one 

the microphone being acoustically coupled to the frequency in said second signal to be transmitted; and 

acoustic speaker at the near-end terminal; t . . , . 4 ... 4 , 

. r e . . . , 40 a control circuit connected to modify at least one param- 

a transmitter for transmitting said second signal to be eter of the speaker . phone system when said detector 

transmitted to the far-end terminal; dctects ^ presence of said at lcast orlc frequency in 

a first circuit for providing a fixed frequency content to sa i d second signal to be transmitted. 

said first, received signal in a frequency range that may 13. The speaker-phone system of claim 12 wherein said 

be contained in said signal to be transmitted, the first 45 filter is a notch filter, 

circuit including a filter for removing from the first, 14 The speaker-phone system of claim 13 wherein said 

received signal at least one frequency that is contained detector is a notch filter. 

in the signal received from the far-end terminal to 15. The speaker-phone system of claim 14 wherein said 

provide the fixed frequency content; notc h filter filters frequencies between about 1250 Hz and 

a detector to detect the presence of a modification of said 50 1550 Hz. 

known frequency content in said second signal to be 16. The speaker-phone system of claim 14 wherein said 

transmitted, the detector including a power detector for filter has a center frequency of about 1400 Hz. 

detecting signal power at the at least one frequency 17. The speaker-phone system of claim 12 wherein said 

removed from the first, received signal; and detector comprises a band pass filter. 

a control circuit connected to the first circuit and the 55 18. The speaker-phone system of claim 12 wherein said 

detector to modify at least one parameter of the detector comprises a power detector for detecting signal 

speaker-phone system when said detector detects a long power at a frequency of said filter. 

term power level greater than a noise floor in the second 19. The speaker-phone system of claim 12 wherein said 

signal to be transmitted at the at least one frequency control circuit comprises a circuit to adjust an attenuation of 

removed from the first, received signal. 60 the signal from the far-end. 

2. The speaker-phone system of claim 1 wherein said first 20. The speaker-phone system of claim 12 wherein said 
circuit for providing a fixed frequency content to said first control circuit comprises a circuit to adjust parameters of a 
received signal comprises a filter for removing from the first, line echo cancellation circuit and an acoustic echo cancel- 
received signal at least one frequency that is contained in lation circuit. 

said second signal to be transmitted. 65 21. The speaker-phone system of claim 12 further com- 

3. The speaker-phone system of claim 2 wherein said filter prising: 

is a notch filter. . a second acoustic speaker at the far-end terminal; 
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a second receiver for receiving the second signal from the 
near-end terminal for driving the second acoustic 
speaker at the far-end terminal; 

a second microphone for converting an acoustic signal at 
the far-end terminal to the first signal to be transmitted 
to the near-end terminal; 

a second transmitter for transmitting said first signal at the 
far-end terminal to the near-end terminal; 

a second filter for removing from the second signal 
received from the near-end terminal at least one fre- 
quency that is contained in said first signal to be 
transmitted; 

a second detector to detect the presence of said at least one 

frequency in said first signal to be transmitted; 
and a control connected to modify at least one parameter 
of the speaker-phone system at said far-end terminal 
when said second detector detects the presence of said 
at least one frequency in said first signal to be trans- 
mitted from said far-end terminal. 
22. A method for operating a speaker-phone system for 
communication between a near-end terminal and a far-end 
terminal, said speaker-phone system having an acoustic 
speaker at the near-end terminal, a receiver for receiving a 
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28. The method of claim 22 wherein said step of removing 
comprises filtering a band of frequencies from said received 
signal that has a center filtered frequency of about 1400 Hz. 

29. A communication system, comprising: 

a circuit for removing a frequency component from a 

received signal to output a modified received signal; 
a speaker for converting the modified received signal to 

an acoustic signal; 
a microphone, acoustically coupled to the speaker, for 

converting an acoustic signal to an electrical signal for 

transmitting; 

a detector for producing a detector output if said removed 
frequency is present in said acoustic signal; and 

a circuit for modifying a circuit parameter applied to said 
received signal when said detector produces a detector 
output. 

30. The communication system of claim 29 wherein said 
circuit for removing a frequency component is a notch filter. 

31. The communication system of claim 30 wherein said 
notch filter has a sharp pass band between about 1250 Hz 
and 1550 Hz. 



32. The communication system of claim 30 wherein said 
first, received signal from the far-end terminal for driving 25 notch filter has a center frequency of about 1400 Hz. 
the acoustic speaker, a microphone for converting an acous- 33. The communication system of claim 30 wherein said 
tic signal at the near-end to a converted signal to be detector has a sharp detection frequency band approximately 
transmitted, the microphone being acoustically coupled to corresponding to the pass band of said notch filter, 
the speaker at the near-end terminal, and a transmitter for 34 Thc commumca tion system of claim 29 wherein said 

circuit for modifying a circuit parameter comprises a circuit 
to modify an attenuation applied to said received signal. 



transmitting said converted signal to the far-end terminal, 
comprising: 

removing a known content from said first, received signal 
within a frequency range that is contained in said 
converted signal; 

detecting the presence of a change to said known content 
in said converted signal; 

and modifying at least one parameter of the speaker- 
phone system when said change to said known content 
is detected in said converted signal. 

23. The method of claim 22 wherein said step of removing 
comprises filtering said first, received signal to remove from 
the received signal at least one frequency that is contained 
in said converted signal. 

24. The method of claim 22 wherein said step of removing 
comprises filtering a band of frequencies from said received 
signal to remove from the received signal a band of fre- 
quencies that is contained in said converted signal. 

25. The method of claim 24 wherein said step of modi- 
fying comprises adjusting an attenuation of the received 
signal. 

26. The method of claim 24 wherein said step of modi- 
fying comprises adjusting parameters of a line echo cancel- 
lation circuit and an acoustic echo cancellation circuit. . 

27. The method of claim 22 wherein said step of removing 
comprises filtering a band of frequencies between about 
1250 Hz and 1550 Hz from said received signal. 



30 



35 



35. A method for processing communication signals, 
comprising: 

removing a frequency component from a received signal 

to produce a modified received signal; 
converting the modified received signal to a first acoustic 
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converting a second acoustic signal to an electrical signal 

for transmitting, said second acoustic signal containing 

said first acoustic signal; 
and modifying circuit parameters applied to said received 

signal if said frequency component is present in said 

acoustic signal. 

36. The method of claim 35 wherein said step of removing 
a frequency component from a received signal comprises 
filtering a frequency band. 

37. The method of claim 36 wherein said step of filtering 
a frequency band comprises filtering a frequency band that 
is expected to be contained in said acoustic signal. 

38. The method of claim 35 wherein said step of modi- 
fying circuit parameters applied to said received signal 
comprises detecting whether said frequency component 
exists in said acoustic signal. 
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